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Abstract 

We study the complexity of approximating the vertex expansion of graphs G = (V, E), defined as 

A/def . \N(S)\ 

d> = minw ■ . 

T scv \S\\V\S\ 



We give a simple polynomial-time algorithm for finding a subset with vertex expansion 0( ^JOPJ\ogd) 
where d is the maximum degree of the graph. Our main result is an asymptotically matching lower bound: 
under the Small Set Expansion (SSE ) hypothesis, it is hard to find a subset with expansion less than 
C -y/OPTlog<i for an absolute constant C. In particular, this implies for all constant s > 0, it is SSE-hard 
to distinguish whether the vertex expansion < s or at least an absolute constant. The analogous threshold 
for edge expansion is VOPT with no dependence on the degree; thus our results suggest that vertex 
expansion is harder to approximate than edge expansion. In particular, while Cheeger's algorithm can 
certify constant edge expansion, it is SSE -hard to certify constant vertex expansion in graphs. 

Our proof is via a reduction from the Unique Games instance obtained from the SSE hypothesis to the 
vertex expansion problem. It involves the definition of a smoother intermediate problem we call Balanced 
Analytic Vertex Expansion which is representative of both the vertex expansion and the conductance 
of the graph. Both reductions (from the UGC instance to this problem and from this problem to vertex 
expansion) use novel proof ideas. 
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1 Introduction 

Vertex expansion is an important parameter associated with a graph, one that has played a major role in both 
algorithms and complexity. Given a graph G = (V,E), the vertex expansion of a set S c V of vertices is 
defined as 

v « m .jw 

^ v ' ' ' |5||V\5| 

Here Af(S) denotes the outer boundary of the set S , i.e. N(S) = {/ e V\S|3w € S such that {«, v) e £■}. The 

vertex expansion of the graph is given by V = mins c y V (>S). The problem of computing V is a major 
primitive for many graph algorithms specifically for those that are based on the divide and conquer paradigm 
[LR99] . It is NP-hard to compute the vertex expansion V of a graph exactly. In this work, we study the 
approximability of vertex expansion V of a graph. 

A closely related notion to vertex expansion is that of edge expansion. The edge expansion of a set S is 
defined as 

<f>(S) = 77TT 

fx(S) 

and the edge expansion of the graph is <p = minscy 0(S). Graph expansion problems have received much 
attention over the past decades, with applications to many algorithmic problems, to the construction of 
pseudorandom objects and more recenlty due to their connection to the unique games conjecture. 

The problem of approximating edge or vertex expansion can be studied at various regimes of parameters 
of interest. Perhaps the simplest possible version of the problem is to distinguish whether a given graph is an 
expander. Fix an absolute constant 6q. A graph is a So- vertex (edge) expander if its vertex (edge) expansion is 
at least So- The problem of recognizing a vertex expander can be stated as follows: 

Definition 1.1. Given a graph G, distinguish between the following two cases 

- (Non-Expander) the vertex expansion is < s 

- (Expander) the vertex expansion is > So for some absolute constant Sq. 
Similarly, one can define the problem of recognizing an edge expander graph. 

Notice that if there is some sufficiently small absolute constant e (depending on So), for which the above 
problem is easy, then we could argue that it is easy to "recognize" a vertex expander. For the edge case, the 
Cheeger's inequality yields an algorithm to recognize an edge expander. In fact, it is possible to distinguish 
a So edge expander graph, from a graph whose edge expansion is < Sq/2, by just computing the second 
eigenvalue of the graph Laplacian. 

It is natural to ask if there is an efficient algorithm with an analogous guarantee for vertex expansion. 
More precisely, is there some sufficiently small s (an arbitrary function of So), so that one can efficiently 
distinguish between a graph with vertex expansion > So from one with vertex expansion < e. In this work, 
we show a hardness result suggesting that there is no efficient algorithm to recognize vertex expanders. More 
precisely, our main result is a hardness for the problem of approximating vertex expansion in graphs of 
bounded degree d. The hardness result shows that the approximability of vertex expansion degrades with the 
degree, and therefore the problem of recognizing expanders is hard for sufficiently large degree. Furthermore, 
we exhibit an approximation algorithm for vertex expansion whose guarantee matches the hardness result up 
to constant factors. 



Related Work. The first approximation for conductance was obtained by discrete analogues of the Cheeger 
inequality shown by Alon-Milman [AM85] and Alon [AI086]. Specifically, Cheeger's inequality relates the 
conductance O to the second eigenvalue of the adjacency matrix of the graph - an efficiently computable 
quantity. This yields an approximation algorithm for O, one that is used heavily in practice for graph 
partitioning. However, the approximation for O obtained via Cheeger's inequality is poor in terms of a 
approximation ratio, especially when the value of O is small. An 0(\ogn) approximation algorithm for O 
was obtained by Leighton and Rao [LR99]. Later work by Linial et al. [LLR95] and Aumann and Rabani 
[AR98] established a strong connection between the Sparsest Cut problem and the theory of metric spaces, 
in turn spurring a large and rich body of literature. The current best algorithm for the problem is an 0( ^J\ogn) 
approximation for due to Arora et al. [ARV04] using semidefinite programming techniques. 

Ambiihl, Mastrolilli and Svensson [AMS07] showed that V and <p have no PTAS assuming that SAT does 
not have sub-exponential time algorithms. The current best approximation factor for V is ( ^logn log log n) 
obtained using a convex relaxation [FHL08]. Beyond this, the situation is much less clear for the approxima- 
bility of vertex expansion. Applying Cheeger's method leads to a bound of 0{ V^OPT) [AI086] where d is 
the maximum degree of the input graph. 

Small Set Expansion Hypothesis. A more refined measure of the edge expansion of a graph is its expansion 
profile. Specifically, for a graph G the expansion profile is given by the curve 

0(5)- min (p(S) V8e [0,1/2]. 

The problem of approximating the expansion profile has received much less attention, and is seemingly far 
less tractable. In summary, the current state-of-the-art algorithms for approximating the expansion profile of 
a graph are still far from satisfactory. Specifically, the following hypothesis is consistent with the known 
algorithms for approximating expansion profile. 

Hypothesis (Small-Set Expansion Hypothesis, [RS10]). For every constant n > 0, there exists sufficiently 
small 8 > such that given a graph G it is NP-hard to distinguish the cases, 

Yes: there exists a vertex set S with volume p(S ) - 8 and expansion cf)(S ) < n, 

No: all vertex sets S with volume p(S) - 8 have expansion c/>(S) > 1 — 77. 

Apart from being a natural optimization problem, the Small-Set Expansion problem is closely tied to 
the Unique Games Conjecture. Recent work by Raghavendra-Steurer [RS10] established reduction from the 
Small-Set Expansion problem to the well known Unique Games problem, thereby showing that Small-Set 
Expansion Hypothesis implies the Unique Games Conjecture. This result suggests that the problem of 
approximating expansion of small sets lies at the combinatorial heart of the Unique Games problem. 

In a breakthrough work, Arora, Barak, and Steurer [ABS 10] showed that the problem Small-Set Expansion^, 8) 
admits a subexponential algorithm, namely an algorithm that runs in time exp(n' ? /5). However, such an 
algorithm does not refute the hypothesis that the problem Small-Set Expansion^, 8) might be hard for every 
constant i] > and sufficiently small 8 > 0. 

The Unique Games Conjecture is not known to imply hardness results for problems closely tied to graph 
expansion such as Balanced Separator. The reason being that the hard instances of these problems are 
required to have certain global structure namely expansion. Gadget reductions from a unique games instance 
preserve the global properties of the unique games instance such as lack of expansion. Therefore, showing 
hardness for graph expansion problems often required a stronger version of the Expanding Unique Games, 



where the instance is guaranteed to have good expansion. To this end, several such variants of the conjecture 
for expanding graphs have been defined in literature, some of which turned out to be false [AKK + 08]. 
The Small-Set Expansion Hypothesis could possibly serve as a natural unified assumption that yields all 
the implications of expanding unique games and, in addition, also hardness results for other fundamental 
problems such as Balanced Separator. In fact, Raghavendra, Steurer and Tulsiani [RST12] show that the the 
SSE hypothesis implies that the Cheeger's algorithm yields the best approximation for the balanced separator 
problem. 

Formal Statement of Results. Our first result is a simple polynomial-time algorithm to obtain a subset 
of vertices S whose vertex expansion is at most 0( -y/OPTlog d). Here d is the largest vertex degree of G. 
The algorithm is based on a Poincaire-type graph parameter called Aco defined by Bobkov, Houdre and Tetali 
[BHTOO], which approximates cp y . While A^ also appears to be hard to compute, its natural SDP relaxation 
gives a bound that is within 0(logd), as observed by Steurer and Tetali [ST12], which inspires our first 
Theorem. 

Theorem 1.2. There exists a polynomial time algorithm which given a graph G — (V, E) having vertex 
degrees at most d, outputs a set S c V, such that (p^(S) = 0( ^ t fOPT\ogd). 

It is natural to ask if one can prove better inapproximability results for vertex expansion than those that 
follow from the inapproximability results for edge expansion. Indeed, the best one could hope for would be a 
lower bound matching the upper bound in the above theorem. Our main result is a reduction from SSE to the 
problem of distinguishing between the case when vertex expansion of the graph is at most e and the case 
when the vertex expansion is at least Q.( ^Jelogd). This immediately implies that it is SSE -hard to find a 
subset of vertex expansion less than C -y/OPTlog<i for some constant C. To the best of our knowledge, our 
work is the first evidence that vertex expansion might be harder to approximate than edge expansion. More 
formally, we state our main theorem below. 

Theorem 1.3. For every e, r\ > 0, there exists an absolute constant C such that it is SSE -hard to distinguish 
between the following two cases for a given graph G with maximum degree d. 

Yes / There exists a set S c V of size \S\ = \V\/2 such that 

No ; For all sets S c V, 

cf> v (S ) > min(lCr 10 , C ^slogd) - n 

By a suitable choice of parameters in the above theorem, we obtain the main theorem of this work, 
Theorem 1.3. 

Theorem 1.4. There exists an absolute constant So > such that for every constant e > the following 
holds: Given a graph G, it is SSE -hard to distinguish between the following two cases: 

Yes ; There exists a set S c V of size \S\ = \V\/2 such that (p^iS) < e 

No : (G is a vertex expander with constant expansion) For all sets S c V, V (5) ^ So 



In particular, the above result implies that it is SSE -hard to certify that a graph is a vertex expander with 
constant expansion. This is in contrast to the case of edge expansion, where the Cheeger's inequality can be 
used to certify that a graph has constant edge expansion. 

At the risk of being redundant, we note that our main theorem implies that any algorithm that outputs a set 
having vertex expansion less than C -y/OPTlog<i will disprove the SSE hypothesis; alternatively, to improve 
on the bound of 0( -y/OPTlog<i), one has to disprove the SSE hypothesis. From an algorithmic standpoint, 
we believe that Theorem 1 .4 exposes a clean algorithmic challenge of recognizing a vertex expander - a 
challenging problem that is not only interesting on its own right, but whose resolution would probably lead to 
a significant advance in approximation algorithms. 

At a high level, the proof is as follows. We introduce the notion of Balanced Analytic Vertex Expansion 
for Markov chains. This quantity can be thought of as a CSP on (d+ l)-tuples of vertices. We show a reduction 
from Balanced Analytic Vertex Expansion of a Markov chain, say H, to vertex expansion of a graph, say 
Hi (Section 7). Our reduction is generic and works for any Markov chain H. Surprisingly, the CSP-like 
nature of Balanced Analytic Vertex Expansion makes it amenable to a reduction from Small-Set Expansion 
(Section 6). We construct a gadget for this reduction and study its embedding into the Gaussian graph to 
analyze its soundness (Section 4 and Section 5). The gadget involves a sampling procedure to generate a 
bounded-degree graph. 

2 Proof Overview 

Balanced Analytic Vertex Expansion. To exhibit a hardness result, we begin by defining a combinatorial 
optimization problem related to the problem of approximating vertex expansion in graphs having largest 
degree d. This problem referred to as Balanced Analytic Vertex Expansion can be motivated as follows. 

Fix a graph G = (V, E) and a subset of vertices S c V. For any vertex v e V, v is on the boundary 
of the set S if and only if max Me jv(u) |Ij [u] -1$ [v]\ = 1, where N(v) denotes the neighbourhood of vertex 
v. In particular, the fraction of vertices on the boundary of S is given by E„ max„ e yv(„) \lg [u] - I5 [v]\. The 
symmetric vertex expansion of the set S c V is given by, 

\N(S)UN(V\S)\ _ B v ma^ umv) \I s [u]-I s [v]\ 
\S\\V\S\ " E^IIsM-IsMI 

Note that for a degree d graph, each of the terms in the numerator is maximization over the d edges incident 
at the vertex. The formal definition of Balanced Analytic Vertex Expansion is as shown below. 

Definition 2.1. An instance of Balanced Analytic Vertex Expansion, denoted by (V, P), consists of a set 
of variables V and a probability distribution P over (d + l)-tuples in V d+l . The probability distribution P 
satisfies the condition that all its d + 1 marginal distributions are the same (denoted by ju). The goal is to solve 
the following optimization problem 

def EfY y, ya~p max.- \F(Yi) - F(X)\ 
0(V,P)= min ( ' '-"^ , ' 

F:V^{0,l]\Ex,T~ ll \F(X)-F(Y)\>^ 5 ^X,Y~ M \F(X) - F(Y)\ 

For constant d, this could be thought of as a constraint satisfaction problem (CSP) of arity d + 1 . Every d- 
regular graph G has an associated instance of Balanced Analytic Vertex Expansion whose value corresponds 
to the vertex expansion of G. Conversly, we exhibit a reduction from Balanced Analytic Vertex Expansion 
to problem of approximating vertex expansion in a graph of degree poly(<i) (Section 7 for details). 
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Figure 1: Reduction from SSE to Vertex Expansion 



Dictatorship Testing Gadget. As with most hardness results obtained via the label cover or the unique 
games problem, central to our reduction is an appropriate dictatorship testing gadget. 

Simply put, a dictatorship testing gadget for Balanced Analytic Vertex Expansion is an instance ( H R of 
the problem such that, on one hand there exists the so-called dictator assignments with value £, while every 
assignment far from every dictator incurs a cost of at least 0( -y/elogii). 

The construction of the dictatorship testing gadget is as follows. Let H be a Markov chain on vertices 
{s, t, f,s'} connected to form a path of length three. The transition probabilities of the Markov chain 
*H are so chosen to ensure that if yu# is the stationary distribution of H then /i#(?) = ////(/') = e/2 and 
Hh{s) - Hh{s') = (1 - e)/2. In particular, H has a vertex separator {t, t'} whose weight under the stationary 
distribution is only e. 

The dictatorship testing gadget is over the product Markov chain H R for some large constant R. The 
constraints P of the dictatorship testing gadget H R are given by the following sampling procedure, 

- Sample x e H R from the stationary distribution of the chain. 

- Sample J-neighbours y\,...,yd £ H R of x independently from the transition probabilities of the chain 
H R . Output the tuple (x,y\,..., yd)- 

For every i e [R], the i th dictator solution to the above described gadget is given by the following function, 

fl if x t e{s,t} 
10 otherwise 

It is easy to see that for each constraint (x, y\, . . . , yd) ~ P, maxy \F(x) - F(yj)\ = unless Xj - t or x, = t'. 
Since x is sampled from the stationary distribution for /jh, %i £ U, f} happens with probability e. Therefore 
the expected cost incurred by the i th dictator assignment is at most s. 

Soundness Analysis of the Gadget. The soundness property desired of the dictatorship testing gadget can 
be stated in terms of influences. Specifically, given an assignment F : V(H) R —> [0,1], the influence of the i ,h 
coordinate is given by Inf^F] = E X[R]V Var Xi [F(x)], i.e., the expected variance of the function after fixing all 
but the i th coordinate randomly. Henceforth, we will refer to a function F : H R — > [0, 1] as far from every 
dictator if the influence of all of its coordinates are small (say < r). 

We show that the dictatorship testing gadget H R described above satisfies the following soundness - for 
every function F that is far from every dictator, the cost of F is at least Q{ -^slogd). To this end, we appeal 
to the invariance principle to translate the cost incurred to a corresponding isoperimetric problem on the 
Gaussian space. More precisely, given a function F : H R — > [0, 1], we express it as a polynomial in the 
eigenfunctions over H. We carefully construct a Gaussian ensemble with the same moments up to order two, 
as the eigenfunctions at the query points (x, y\, . . . , yd) € P. By appealing to the invariance principle for low 
degree polynomials, this translates in to the following isoperimetric question over Gaussian space Q., 

Suppose we have a subset S QQ of the ra-dimensional Gaussian space. Consider the following experiment: 

- Sample a point z e § the Gaussian space. 

- Pick d independent perturbations z\ , z' 2 , ■ ■ ■ , z' d of the point z by £-noise. 

- Output 1 if at least one of the edges (z, z-) crosses the cut (S, S) of the Gaussian space. 



Among all subsets 5 of the Gaussian space with a given volume, which set has the least expected output 
in the above experiment? The answer to this isoperimetric question corresponds to the soundness of the 
dictatorship test. A halfspace of volume ^ has an expected output of yje log d in the above experiment. We 
show that among all subsets of constant volume, halfspaces acheive the least expected output value. 

This isoperimetric theorem proven in Section 4 yields the desired Q( yi log d) bound for the soundness 
of the dictatorship test constructed via the Markov chain H. Here the noise rate of e arises from the fact 
that all the eigenfunctions of the Markov chain H have an eigenvalue smaller than 1 - s. The details of the 
argument based on invariance principle is presented in Section 5 

We show a Q( yi log d) lower bound for the isoperimetric problem on the Gaussian space. The proof of 
this isoperimetric inequality is included in Section 4 

We would like to point out here that the traditional noisy cube gadget does not suffice for our application. 
This is because in the noisy cube gadget while the dictator solutions have an edge expansion of e they have a 
vertex expansion of sd, yielding a much worse value than the soundness. 

Reduction from Small-Set Expansion problem. Gadget reductions from the Unique Games problem 
cannot be used towards proving a hardness result for edge or vertex expansion problems. This is because if 
the underlying instance of Unique Games has a small vertex separator, then the graph produced via a gadget 
reduction would also have small vertex expansion. Therefore, we appeal to a reduction from the Small-Set 
Expansion problem (Section 6 for details). 

Raghavendra et al. [RST12] show optimal inapproximability results for the Balanced separator problem 
using a reduction from the Small-Set Expansion problem. While the overall approach of our reduction is 
similar to theirs, the details are subtle. Unlike hardness reductions from unique games, the reductions for 
expansion-type problems starting from Small-Set Expansion are not very well understood. For instance, the 
work of Raghavendra and Tan [RT12] gives a dictatorship testing gadget for the Max-Bisection problem, but 
a Small-Set Expansion based hardness for Max-Bisection still remains open. 

2.1 Notation 

We use hq to denote a probability distribution on vertices of the graph G. We drop the subscript G, when the 
graph is clear from the context. For a set of vertices S , we define /i(S) = f n{x). We use fi\s to denote the 
distribution fi restricted to the set 5 c V(G). If the graph has vertex weights, we denote the weight of a vertex 
v € V(G) by wiv) and denote the weight of a set S QV by w(S ). We denote the degree of a vertex v by deg(u). 
We denote the neighborhood of S in G by Ng(S), i.e. 

N G (S) = {v € S\3u € S such that {u, v\ e E(G)} . 

We drop the subscript G when the graph is clear from the context. 

2.2 Organization 

We begin with some definitions and the statements of the SSE hypotheses in Section 3. In Section A, we 
show that the computation of vertex expansion and symmetric vertex expansion is equivalent upto constant 
factors. We prove a new Gaussian isoperimetry results in Section 4 that we use in our soundness analysis. In 
Section 5 we show the construction of our main gadget and analyze its soundness and completeness using 
Balanced Analytic Vertex Expansion as the test function. We show a reduction from a reduction from 
Balanced Analytic Vertex Expansion to vertex expansion in Section 7. In Section 6, we use this gadget to 



show a reduction SSE to Balanced Analytic Vertex Expansion. Finally, in Section 8, we show how to put all 
the reductions togethor to get optimal SSE -hardness for vertex expansion. 

Complimenting our lower bound, we give an algorithm that outputs a set having vertex expansion at most 
0( yOPTlogfif) in Section 9. 

3 Preliminaries 

Symmetric Vertex Expansion. For our proofs, the notion of Symmetric Vertex Expansion is useful. 

Definition 3.1. Given a graph G = (V, E), we define the the symmetric vertex expansion of a set S c V as 

follows. 

^v^def \N G (S)UN G (V\S)\ 
®o(S) = n- m ^ l 

Balanced Vertex Expansion. We define the balanced vertex expansion of a graph as follows. 

Definition 3.2. Given a graph G and balance parameter b, we define the b-balanced vertex expansion of G as 
follows. 

,V,bal def . •V/c\ 

<p h = mm f(S). 

S:\S\\V\S\>bn 2 

and 

0^ ba|d ^ f min O v (S). 

S:\S\\V\S\>bn 2 



We define <^ bal t f £** and <D V ' bal d = oY;^ 



1/100 1/100" 

Analytic Vertex Expansion. Our reduction from SSE to vertex expansion goes via an intermediate problem 
that we call J-Balanced Analytic Vertex Expansion. We define the notion of ^-Balanced Analytic Vertex 
Expansion as follows. 

Definition 3.3. An instance of J-Balanced Analytic Vertex Expansion, denoted by (V, P), consists of a set 
of variables V and a probability distribution P over (d + l)-tuples in V d+l . The probability distribution P 
satisfies the condition that all its d + 1 marginal distributions are the same (denoted by /u). The ^-Balanced 
Analytic Vertex Expansion under a function F : V h» {0, 1} is defined as 

def E(x,r 1 ,...,y t/ )~f>max; \F(Yj) - F(X)\ 
VKV.rw) te X j^\F(X)-F(Y)\ 

The J-Balanced Analytic Vertex Expansion of (V, P) is defined as 

®(V,P) d = min 0(V,P)(F). 

F:V^{0,l}\W, x , Y ^\F(X)-F(Y)\>-^ 

When drop the degree d from the notation, when it is clear from the context. 

For an instance (V, P) of Balanced Analytic Vertex Expansion and an assignment F : V \-* {0, 1} define 

valp(F) - E max \F(Y { ) - F(X)\ . 

(X,Yi,...,Y d )~P i 



Gaussian Graph. Recall that two standard normal random variables X, Y are said to be or-correlated if 
there exists an independent standard normal random variable Z such that Y - aX + Vl - a 2 Z. 

Definition 3.4. The Gaussian Graph 0\,i. is a complete weighted graph on the vertex set V(@az) = R". 
The weight of the edge between two vertices u, v e V(@ax) is given by 

w({u, v}) = P [X = u and Y = v] 

where Y ~ N(AX, E), where A is a diagonal matrix such that ||A|| < 1 and S > si is a diagonal matrix. 

Remark 3.5. Note that for any two non-empty disjoint sets S i, S2 c V(@a,i.), the total weight of the edges 
between 5 1 and S 2 can be non-zero even though every single edge in the Qax has weight zero. 

Definition 3.6. We say that a graph G = (V, E) is @(<i)-regular, if there exist absolute constants c\,C2 e R + 
such that for all vertices i e V, we have ci<i < deg(/) < C2d. 

For a random variable X, define the variance and t\ -variance as follows, 

Var[X]= E[(A- 1 -X 2 ) 2 ] Var^X] = E[|Xi-X 2 |] 

where X\ , X2 are two independent samples of X. 

Small-Set Expansion Hypothesis. 

Problem 3.7 (Small-Set Expansion (y, 5)). Given a regular graph G = (V,E), distinguish between the 
following two cases: 

Yes: There exists a non-expanding set 5 c V with ju(S) = 5 and ®g(S) < y. 

No: All sets 5 c V with //(S) = 5 are highly expanding having ®g(S) > 1 - y- 

Hypothesis 3.8 (Hardness of approximating Small-Set Expansion). For all y > 0, there exists 8 > swc/z 
?/ja? ?/ze promise problem Small-Set Expansion (y, 6) is HP -hard. 

For the proofs, it shall be more convenient to use the following version of the Small-Set Expansion 
problem, in which we high expansion is guaranteed not only for sets of measure 5, but also within an arbitrary 
multiplicative factor of 5. 

Problem 3.9 (Small-Set Expansion (y, 5, M)). Given a regular graph G = (V,E), distinguish between the 
following two cases: 

Yes: There exists a non-expanding set S c V with /u(S) = 6 and ®g(S) < y. 

No: All sets S c V with p(S) e (£, M§) have O g (5) > 1 - y. 

The following stronger hypothesis was shown to be equivalent to Small-Set Expansion Hypothesis in 
[RST12]. 

Hypothesis 3.10 (Hardness of approximating Small-Set Expansion). For all y > and M > 1, f/zere exists 
6 > smc/z ?/zctf ?/ze promise problem Small-Set Expansion (y, 6, M) is HP-hard. 



4 Isoperimetry of the Gaussian Graph 

In this section we bound the Balanced Analytic Vertex Expansion of the Gaussian graph. For the Gaussian 
Graph, we define the canonical probability distribution on V d+[ as follows. The marginal distribution along 
any component X or 7/ is the standard Gaussian distribution in R", denoted here by /i = Af(0, 1)". 



W =1 w(X,Yi) 



Til 



P @AX (X, Y u ...,Y d )= '= = ai(X)II£i P [Y = Y t ] . 

H(X) a 

Here, random variable Y is sampled from N(AX, E). 

Theorem 4.1. For any closed set S c ofV{Qt^x) (^ - £ I is a diagonal matrix), we have 

%^% max ' |][ ^ [X] ~ ^ [F ' ]I _ E^ E yi ,...y rf ~MAXj:) max,- |I 5 [X] - l s [F,]| 
Ex,r^ Us [X] -I S [Y]\ " E X>F ^ |I 5 [X] -I S [Y]\ 

for some absolute constant c. 



> c yjs log d 



Lemma 4.2. Let u,v eW 1 satisfy \u — v\ < ye log <i a«<i /ef 1, be a diagonal matrix satisfying S > e/. Lef 
P u , P„ &e f/ie distributions N(Au, X) a«<i N(Av, S) respectively. Then, 

drv(P u ,P v )<l--. 

d 

Proof. First, we note that that for the purpose of estimating their total variation distance, we can view P u , P v 
as one-dimensional Gaussians along the line Am - Av. Since ||A|| < 1, 

|| Am - Av\\ < ||m - v\\ < ^elogd . 

Wlog, we may take Au - and Ay = ye log <i. Next, by the definition of total variation distance, 

djy{Pu,Pv) = \ \Pv(x) - P u (x)\dx 

Jx:P v {x)>P u (x) 

f*CO 

(P v (x) - P u (x))dx 

JAv/2 



i r°° fc^p , l r°° _m£ , 

I e 2 " dx I e 2e dx 

2ns JAv/2 y2ne JAv/2 

L_ f A 

2ns J-fi 



line JAv/2 ylTTE JAv/2 

e 2e ax 
/2^e J-Au/2 

1 fV^/2 _rf 
— - e i dx 



||2 



V2tt J-yiogrf/2 

i r°° 

= 1-2- — — e~^ dx 

y2n J Viog^/2 

1 
where the last step uses a standard bound on the Gaussian tail. 
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Proof of Theorem 4.1. Let fix denote the Gaussian distribution N(AX, E). Then the LHS is: 
f (l - (1 - fix(S)) d ) dfi{X) + f (l - (1 - A*x(R" \ S)) d ) dfi(X). 

JR"\S JS 

To bound this, we will restrict ourselves to points X for which the fix measure of the complementary set is at 
least l/d. Roughly speaking, these will be points near the boundary of S . Define: 

Si = jx€S : fix(R" \S) < ^\ , S 2 = U eH n \S : fi x (S) < -^ 

and 

5 3 = R"\5!\5 2 . 

For u € R", let P u be the distribution N(Au, X). For any u e S i,v e S2, we have 

J „ (P ,„ P „ )>1 _i__i_ = 1 _'. 

Therefore, by Lemma 4.2, \\u - v\\ > ^/e log d, i.e., d{S\,Si) > -^e log d. Next we bound the measure of 
53. We can assume wlog that fi(S) < fi(K n \ S) and fi(S\) > fi(S)/2 (else fi(Sj) > fi(S)/2 and we are done). 
Applying the isoperimetric inequaity for Gaussian space (see surveys [Vem05, VLM10]), for subsets at this 
distance, 

fi(S 3 )> ^^e\ogd-fi{S l )fi{S 2 )> ^^^-■fi(SMR n \S). 

We are now ready to complete the proof. 



> 2 



l (f (1 - (1 - fi x (S)) d ) dfi(X) + f (1 - (1 - Hx(R n \ S)) dfi(X)\ 

1 \Jr"\s Js I 

Uf {\-(l-fi x (S)) d )dfi(X)+ [ {\-(l-fi X (R n \S))dfi{X)\ 

1 \JXeR"\S,iix(S)»l/d JXeS,ii x (R"\S)>l/d I 

- ( f dfi{X) + f dfl{X)\ 

\JXeR"\SMx(S)>\/d JXeS,u x (R"\X)>l/d I 



2e \JXeR"\S,fi x (S)>Vd JXeS^xiR'^X^i/d 

e-\ 

> -^-KSi) 

> c^e\ogd-fi(S)fiCR n \S). 



We prove the following Theorem which helps us to bound the isoperimetry of the Gaussian graph for 
over all functions over the range [0, 1]. 

Theorem 4.3. Given an instance (V, f) and a function F : V t-* [0, 1], there exists a function F' : V 1— > {0, 1}, 

such that 

E(x,r 1 ,...,y,)~p max, \F(X) - F(Yj)\ > ^ {X j,,...,Y d) ~p max,- \F'{X) - F'(Yj)\ 

^x,y~u \F(X) - F(Y)\ " Bx,y~u \F'(X) - F'(Y)\ 
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Proof. For every r e [0, 1], we define F r : V \-> {0, 1} as follows. 

(0 F(X) < r 

Clearly, 

F(X)= f F r (X)dr. 



Now, observe that if F(X) - F(Y) > then F r (X) - F r (Y) >0Vre [0, 1] and similiarly, if F(X) - F(Y) < 
then F r (X) - F r (Y) < Vr e [0, 1]. Therefore, 



\F(X) - F(Y)\ = 



f (F r (X)-F r (Y))dr = f |F r (X)-F r (Y)|rfr. 

Jo Jo 



Also, observe that if \F(X) - F(Fi)| ^ |F(F) - F(X)| then 

|F r (X> - F r (Fi)| > \F r (Y t ) - F r (X)\ Vr € [0, 1] 
Therefore, 

E(x,F,,...,y rf )~!pmax; \F(X) - F(Y t )\ ^(X,Y u ...j d )~9 max,- ^ \F r (X) - F,-(F,-)| ^r 

Ex,f~„ |F(X) - F(F)| %>y ^ £ | F/ . (X) _ Fr(F) | Jr 

J (E(x ; r 1 ,...,r d )~7>max i |F r (X) - F r (F;)|) Jr 

£(B x>Y ^\F r (X)-F r (Y)\)dr 

. fc(X,Y u ..,Y d )~<P max; |F r (X) - F r (F,)| 
^ nun 

«=[0,1] B X ,Y^ \F r (X) - F r (Y)\ 

Let r' be the value of r which minimizes the expression above. Taking F' to be F r > finishes the proof. 



□ 



Corollary 4.4 (Corollary to Theorem 4.1 and Theorem 4.3). Let F : V(@ax) i_> [0, 1] &<? any function. Then, 
for some absolute constant c, 



> c yjs log d . 



^(x,Y u ...,Y d )~P g ^ max,- \F(X) - F(F,-) 
E x ,y^ |F(X) - F(F)| 

5 Dictatorship Testing Gadget 

In this section we initiate the construction of the dictatorship testing gadget for reduction from SSE . 

Overall, the dictatorship testing gadget is obtained by picking an appropriately chosen constant sized 
Markov-chain H, and considering the product Markov chain H R . Formally, given a Markov chain H, define 
an instance of Balanced Analytic Vertex Expansion with vertices as Vh and the constraints given by the 
following canonical probability distribution over V^ +l . 

- Sample X ~ //#, the stationary distribution of the Markov chain Vh- 
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Sample Y\,. . . ,Yd independently from the neighbours of X in Vh 



For our application, we use a specific Markov chain H on four vertices. Define a Markov chain H on 

j^, P (t\s) = P (t'\s') = j^- £ , P {s\t) = p(s'\n = \ 



V H = {s,t,t',s'} as follows,p(j|j) = p(s'\s') = 1 - ^-, p(t\s) = p(t'\s') = ^-, p(s\t) = p(s'\t') = \ and 



p(t'\i) - p(t\t') - 5. It is easy to see that the stationary distribution of the Markov chain H over Vh is given 
by, 

1 
p H {s) = Ph{s ) = - - e hh(i) = PhQ ) = e 

From this Markov chain, construct a dictatorship testing gadget (V H , f H ) as described above. We begin by 
showing that this dictatorship testing gadget has small vertex separators corresponding to dictator functions. 

Proposition 5.1 (Completeness). For each i e [/?], the i th -dictator set defined as F(x) = 1 ifxt € {s, t) andO 
otherwise satisfies, 



1 
2 
Proof. Clearly, 



Vari[F] ^ - and val^ R (F) < 2e 



E \F(X)-F(Y)\ = l- 

X,Y~hh I 

Observe that for any choice of (X, Y\, . . . , Yd) ~ Ph r > max; \F(X) - F(Yi)\ is non-zero if and only if either 
Xi = t or Xi = t' . Therefore we have, 

E max \F(X) - F(Y t )\ < P[ Xi e [t, t'}]) = 2e , 

{XJ u ...,Y d )~P H i 

which concludes the proof. □ 

5.1 Soundness 

We will show a general soundness claim that holds for dictatorship testing gadgets (V(H R ),P H R ) constructed 
out of arbitrary Markov chains H with a given spectral gap. Towards formally stating the soundness claim, 
we recall some background and notation about polynomials over the product Markov chain H R . 

5.2 Polynomials over H R 

In this section, we recall how functions over the product Markov chain H R can be written as multilinear 
polynomials over the eigenfunctions of H. 

Let eo, e\ , . . . , e n : V(H) — > R be an orthonormal basis of eigenvectors of H and let Ao, ■ . . , X n be the 
corresponding eigenvalues. Here eo = 1 is the constant function whose eigenvalue Aq = 1. Clearly eo, . . . , e n 
form an orthonormal basis for the vector space of functions from V(H) to R. 

It is easy to see that the eigenvectors of the product chain H R are given by products of eo, ■..,e n . 
Specifically, the eigenvectors of H R are indexed by a € [n] R as follows, 



eAx) = J~[ e o-A x i) 



i=\ 



Every function / : H R — > R can be written in this orthonormal basis f(x) = Zo-e[n] R fa-^a-(x). For a 
multi-index cr € [n] R , the function e a is a monomial of degree \a\ = |{/|cr,- + 0}|. 
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For a polynomial Q = Yjo- Qa-^a-, the polynomial Q >p denotes the projection on to degrees higher than p, 
i.e., Q >p - Yjo-,\cr\>p Qo-ea-- The influences of a polynomial Q = Xa- Qo- are defined as, 



lnf,-(G) = J Ql 



o-:tr,^0 

The above notions can be naturally extended to vectors of multilinear polynomials Q = (Qo, Q\ , . . . , Qj). 

Note that every real- valued function on the vertices V(H) of a Markov chain H can be thought of as a 
random variable. For each i > 0, the random variable e,(x) has mean zero and variance 1. The same holds 
for all ea-(x) for all \cr\ ^ 0. For a function Q : V(H R ) — > R (or equivalently a polynomial), Var[<2] denotes 
the variance of the random variable Q(x) for a random x from stationary distribution of H R . It is an easy 
computation to check that this is given by, 

Var[g]= Y, Ql 

<t:|o-|#0 



2l 



We will make use of the following Invariance Principle due to Isaksson and Mossel [IM12]. 

Theorem 5.2 ([IM 12]). LetX = (X\, . . . , X n ) be an independent sequence of ensembles, such that P[X, = x] > 
a > 0, V/, x. Let Q be a d-dimensional multilinear polynomial such that \lss(Qj(X)) < 1, \lax(Q >p ) < 

(1 - £i]) 2p and lnf,(<2/) < t w/We /? = ^ log(l/r)/ log(l/a). Finally, let \p : R^ i-> R /3e Lipschitz, continuous. 
Then, 



;[<A(<2(X))]-E[<K<2(Z))] 



= 0(rra /10 g«) 



w/We Z ij an independent sequence of Gaussian ensembles with the same covariance structure as X. 

5.3 Noise Operator 

We define a noise operator Ti_^ on functions on the Markov chain H as follows : 

ri_F(X) d = (1 - i7)F(Z) + 77 E^ F(7) 

for every function F : H — » R. Similarly, one can define the noise operator Ti_^ on functions over H R . 

Applying the noise operator Ti_^ on a function F, smoothens the function or makes it closer to a low- 
degree polynomial. This resulting function Ti-^F is close to a low-degree polynomial, and therefore is 
amenable to applying an invariance principle. Formally, one can show the following decay of coefficients of 
high degree for Ti-^F. We defer the proof to the Appendix (Lemma C.l). 

Lemma 5.3. (Decay of High degree Coefficients) Let Qj be the multi-linear polynomial representation of 
Y\- ri F(X), and let e be the spectral gap of the Markov chain H. Then, 

Var(e;0 < (1 - snf p 

Furthermore, on applying the noise operator T\- v , the resulting function Ti_^F can have a bounded 
number of influential coordinats as shown by the following lemma. 

Lemma 5.4. (Sum of Influences Lemma) If the spectral gap of a Markov chain is at least e then for any 
function F : V^ — > R 

V InfKT^F) < -Var[F] 

^— ' TIE 

ie[R] I 
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Proof. By suitable normalization, we may assume without loss of generality that Var[F] = 1. If Q denotes 
the multilinear representation of Ti-^F, then the sum of influences can be written as, 

£ InfKIV.F) < Yj lcrl & 

ie[R] |tr|*0 

< 2 k|(l - rjs^Fl 
|cr|#0 



<( w *a-«.)»)£/i<s 



where we used the fact that the function h(t) = t(l-ne) 2t achieves its maximum value at t = -I ln(l -rjs). n 

5.4 Soundness Claim 

Now we are ready to formally state our soundness claim for a dictatorship test gadget constructed out of a 
Markov chain. 

Proposition 5.5 (Soundness). For all s,rj,a,T > the following holds. Let H be a finite Markov-chain 
with a spectral gap of at least s, and the probability of every state under stationary distribution is ^ a. Let 
F : V(H R ) h-> {0, 1} be a function such that max, e [^] Inf^r^F) < t. Then we have 

E [max \F(Y t ) - F(X)\] > Q( Velogd) E \F(X) - F(Y)\ - 0(n) - t^ 1 lo ^ l / a » 

(X,Y u ...,Y d )~'P H R i x,y~ MhR 

For the sake of brevity, we define soundness(V(// s ), fu R ) to be the following : 
Definition 5.6. 

soundnessCVC^),^^) - mm — — — — — 

F:max, em \n\,(F)^r ^X,Y~ MhR ) \F(X) - F(Y)\ 

In the rest of the section, we will present a proof of Proposition 5.5. First, we construct gaussian random 
variables with moments matching the eigenvectors of the chain H. 

Gaussian Ensembles. Let Q = (Qq, Q\,..., Qd) be the multi-linear polynomial representation of the 
vector-valued function \Y\- ri F(X),T\~ ri F{Y\), . . . ,ri_^F(y^)j. Let E denote the ensemble of nd random 
variables (e (X), e x (X), ..., e n (X)), (e <7i), . . . , e n {Y x % ..., (e (Y d ), ..., e n (Y d )). Let E u ...,E R be R inde- 
pendent copies of the ensemble E. Clearly, the polynomial Q can be thought of as a polynomial over 
Ei,..., Er. For each random variable x in E\ , . . . , Er and a value /3 in its support, P [x = (3\ is at least the 
minimum probability of a vertex in H under its stationary distribution. 

This polynomial Q satisfies the requirements of Theorem 5.2 because on the one hand, the influences 
of F are < r and on the other by Lemma 5.3, \far(Q >p ) < (1 - en) lp . Now we will apply the invariance 
principle to relate the soundness to the corresponding quantity on the gaussian graph, and then appeal to the 
isoperimetric result on the Gaussian graph (Theorem 4.1). 

The invariance principle translates the polynomial (Qo(X), <2i(Ti), ■ • ■ Qdi^d)) on the sequence of inde- 
pendent ensembles E\,..., Er, to a polynomial on a corresponding sequence of gaussian ensembles with the 
same moments up to degree two. 
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Consider the ensemble E. For each i t 0, the expectation E[e,-(X)] = E[e,(Fi) = 0] = . . . E[e,(Frf)] = 0. 
For each i + j, it is easy to see that, B[ei(X)ej(X)] = Eh?,-(Fi)e y -(Ti)] = . . . B[ei(Y d )ej(Y d ] = 0. Moreover, 
H[ei(X)ej(Y a )] = E[e,(F a )e ; (Fi)] = whenever i + j and alia, b e {1, . . . d}. The only non-trivial correlations 
are E[e r (X)e,(F a )] and E[e,(F a )e,(Ff,)] for all i e [n] and a, b € [d]. It is easy to check that 

E[ ei (X) ei (Y a )] = At B[ei(Y a )ei(Y b )] = A] 

From the above discussion, we see that the following gaussian ensemble z, = {zx,Zy { , ■ ■ ■ ,Zy d ) has the same 
covariance as the ensemble E. 

1. Sample zx and n-dimensional Gaussian random vector. 

2. Sample zy l; . . . ,zy d £ R" i.i.d as follows : The i** coordinate of each zy a is sampled from A{Zx(i) + 
J I - Ajgaj where g a j is a Gaussian random variable independent of zx and all other £ a j. 

Let Zx, Zy l ,.. ., Zy d e R" s be the ensemble obtained by 7? independent samples from z,x, Zj x ,. ■ ■ ,Zy d - 

Let S denote the nR x «F diagonal matrix whose entries are 1 - /I 2 , . . . , 1 - /I 2 repeated 7? times. Since 
the spectral gap of H is s, we have that 1 - i 2 ^ 2s - s 2 > s for all i € { 1 , . . . , n). Therefore, we have £ > sL 

Proof of soundness. Now we return to the proof of the main soundness claim for the dictatorship testing 
gadget (V(H R ), V^r) constructed out an arbitrary Markov chain. 

Proof of Proposition 5.5. Let Q = (Qq, Q\, . . . , Qj) be the multi-linear polynomial representation of the 
vector-valued function m^FfX), T^F^i), . . . , IV^F^))- 
Define a function s : R — > R as follows 



s{x) 



if x < 

x if* e [0,1] 

1 if x > 1 



Define a function *P : R rf+1 —> R as, ^(x, y\, . . . , yi) = max, \s(yt) - s(x)\. Clearly, *P is a Lipshitz function 
with a constant of 1. 

Using the fact that F is bounded in [0, 1], 

E max|F(X)-F(F a )|> E msx\T 1 . v F(X)-T l . v F(Y a )\-2r ] (5.1) 

(XJn-JdhPifi a (X,Y U -,Yd)~P H R a 

Furthermore, since Ti-^F is also bounded in [0, 1], we have siT^F) = Yi-^F. Therefore, 

E max|ri_,F0O-ri_F(y fl )|= E maxlsir^FiXti-sir^FiY^l (5.2) 

(X,y u ...,Y d )~>P HR a ' ' (X,Y!,...,Y d )~P HR a I V ' V )\ 

Apply the invariance principle to the polynomial Q = (r^F, IV^F, . . . , IV^F) and Lipshitz function f. By 
invariance principle Theorem 5.2, we get 

E max Ur^FiX)) - s (r^JTO)! 

> E m^\s{Y x . n F{Zx))-s{Y x ^F{Zy c )\-^^ x ^ x ^ (5.3) 
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Observe that s o (Ti-^F) is bounded in [0, 1] even over the gaussian space. Hence, by using the isoperimetric 
result on gaussian graphs (Corollary 4.4), we know that 

E max Is (T^FiZxJ) - s (Y^F{Z Y j)\ > c ^elogd E \s (T X - v F{Z x j) - s (t { ^F{Z y ))\5A) 

Now we apply the invariance principle on the polynomial (T\- tl F, Y\- n F) and the functional *F : R 2 — > R 
given by ¥(«, b) - \s(a) - s(b)\. This yields, 

Zx,Zy~[lg AJ . I v /v 'I I v /v 'I 

Over // R , the function Ti-^F is bounded in [0, 1], which implies that s^i^FiX)) = Ti- V F(X) and 
r Utl F(X) > F(X) - r,. 

E Isir^FiXJj-sir^FiYJjl >B x ^ kh r ) \F(X)-F(Y)\-2t] (5.6) 

From equations (5.1) to (5.6) we get, 

E max \F(X) - F(Y a )\ > Q( Velop) E x y _ (tf!) \F(X) - F(Y)\ -4tj- -fi^il lo ^ 1/a » 

(X,Y 1 ,...,Y d )~P B g a ' 



6 Hardness Reduction from SSE 

In this section we will present a reduction from Small-Set Expansion problem to Balanced Analytic Vertex 
Expansion problem. 

Let G = (V, E) be an instance of Small-Set Expansion (y, 8, M). Starting with the instance G = (V, E) of 
Small-Set Expansion^, 6, M), our reduction produces an instance CV',P') of Balanced Analytic Vertex 
Expansion. 

To describe our reduction, let us fix some notation. For a set A, let A (R! denote the set of all multisets with 
R elements from A. Let G n - (1 - rf)G + rjKy where Ky denotes the complete graph on the set of vertices V. 
For an integer R, define G® R to be the product graph G® R = (Gr,) R . 

Define a Markov chain H on V H = {s, t, t' , s'} as follows,/?^.?) = p{s'\s') = 1 - jA^, p(t\s) - p(t'\s') = 
jzt, p(s\i) = p{s'\t') = i and p{t'\t) = p{t\t') - |. It is easy to see that the stationary distribution of the 
Markov chain H over V# is given by, 

1 
Hh{s) = 1*h(s ) = - - £ jUff(0 = p.H(t ) = e 

The reduction consists of two steps. First, we construct an "unfolded" instance (*V, P) of the Balanced 
Analytic Vertex Expansion, then we merge vertices of CV,P) to create the final output instance {'V' ,P'). 
The details of the reduction are presented below. 
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Reduction 

Input: A graph G = (V, E) - an instance of Small-Set Expansion^, 8, M). 

Parameters: R=\,e 

Unfolded instance (<V,P) 

Set *V = (V X Vh) R - The probability distribution p on 'V is given by (jxy x //#)*. The probability distribution 

P is given by the following sampling procedure. 

1. Sample a random vertex A e V R . 

2. Sample d + 1 random neighbors B,C\,. . . ,Cd ~ G® R (A) of the vertex A in the tensor-product graph 

3. Sample ieV* from the product distribution p R . 

4. Independently sample d neighbours y( l \ . . . , if d) of x in the Markov chain H R , i.e., y® ~ pfjix). 

5. Output CLB,x),(Cuyi),...,(C d ,y d )) 

Folded Instance CV',!P') 

Fix *V' = (V X {s, ?}) (S) . Define a projection map II : *V — > *V' as follows: 

n(A,x) = {(a,-,x,)|x,e {.?,?}} 

for each (A,x) = ((ai,x\),(a2,X2), .. . ,(cir,xr)) in (V x {s, ?}) (R) . 

Let p! be the probability distribution on *V obtained by projection of probability distribution // on "V. 
Similarly, the probability distribution P' on ( , V') rf+1 by applying the projection n to the probability 
distribution P. 



Observe that each of the queries U(B, x) and {II(C,-, yd} d i=l are distributed according to p! on r V' . Let 
F' : *y' — > {0, 1} denote the indicator function of a subset for the instance. Let us suppose that 

x ^[\F\X)-F\Y)\\>^ 

Theorem 6.1. (Completeness) Suppose there exists a set S c V such that vol(>S) = 6 and 0(5) < y then 
there exists F' : < V' — > {0, 1} such that, 

E [\F'(X)-F'(Y)\]>±- 
x,y~<v 10 

and, 

max \F'(X) - F'(y,)ll < 2e + <9(?7 + j) 

i J 

Proof. Define F : *V — > {0, 1} as follows: 

fi if|n(A,x)n(Sx{M))l = i 

10 otherwise 
Observe that by definition of F, the value of F(A, x) only depends on II(A x). So the function F naturally 
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E 

X,Y u ...,Yd~P 



defines a map F' : W — > {0, 1}. Therefore we can write, 

P[F(A,jc) = 1]- ^P[iiek(|]P[(fli,...,a R )nS = {a,-}!*,- g {s, t}] 

i€[R] 

1 1 / if -1 1 
>tf 1-- > — 

2 R \ R) 10 

and, 

P [F(A, x) = 1] = P [|II(A, x) n (5 x [s, t})\ = 1] < E [|II(A, i)n(Sx(s, f})|] = /i . I . j^ < I 

(A > jc)~'V Z |v| 2 

The above bounds on P [F(A, x) = 1] along with the fact that F takes values only in {0, 1}, we get that 
E \F'(X)-F'(Y)\= E \F(A,x)-F(B,y)\>±- 

X,Y~<V' ' ' (A,x),(B,!/)~T 10 

Suppose we sample A e V R and B,C\,. . . ,Cd independently from G® R (A). Let us denote A = (ai, . . . , a«), 
5 = (/>i, . . . , & R ), Cj = (en , . . . , c, R ) for all / € [rf]. Note that, 

P[3» e [R] such that \{en,bi}nS\ = 1] < Zie[jq(l - 1/) P [(o*. 6<) e £[5,5]] + 17P [fa, fc) eSxS"] 

< 5(vol(5)(D(5) + 2r]V0\(S)) < 2(y + 77) . 

Similarly, for each j e [<f], 

P[3i € [R]\\{ai,Cji} n5| = l] < J] P[(a,-,c; ; ) e £[5,5]] < /?vol(5)0(5) < 2(y + 77). 

By a union bound, with probability at least 1 - 2(d + l)(y + 77) we have that none of the edges {(a,-, £;)} !6 [R] 
and {(a ; , c/j)}je[rf],i€[«] cross the cut (5,5). 

Conditioned on the above event, we claim that if (B, i)fi(Sx{l,f'|)=fl then max, \F(B, x)-F(C,-, «/,)l - 0. 
First, if (5, x) n (5 x {?,?'}) = then for each /3,- € 5 the corresponding x,- e {s, s'}. In particular, this implies 
that for each b{ e S , either all of the pairs (/>,-, x,), {(c,-,-, g//;)} 7 e[d] are either in 5 x {s, t) or 5 x js', t'), thereby 
ensuring that max r \F(B, x) - F(C{, yi)\ - 0. 

From the above discussion we conclude, 



E 

(B,xUC 1 ,y 1 ),...,(C d ,y d )~<P 



max|F(5,x)-F(C«,j/,)| 



< P [|(fi, x) n (5 x {/, /}) I > 1] + 2(d + l)(y + rj) 

< E [|(fi, x) n (5 x [t, t'\) |] + 2(d + l)(y + 77) 

- R • vol(5) • £ + 2(rf + l)(y + 77) = e + 2(rf + l)(y + 77) 



Let F' : 'V' — > {0, 1} be a subset of the instance (*V , P'). Let us define the following notation. 



vaV(F') d = E 

(X,Y u ...,Y d )~f» 



max |F'(X) - F'(Yi)\ Vari[F'] =* E \F'(X) - F'(Y)\ 



We define the functions F : *V -> [0, 1] and ^, g& : V^ — > [0, 1] for each A e V R as follows. 



Hpf Hef 

F(A,x) = F'(n(A,x)) / A (x) = F(A,x) 



g A {x) A = E F(fi,;c) 

B~G® R (A) 



19 



Lemma 6.2. 



Proof. 



walp'(F')^ E val,R(g A ) 

AeV R ^ H 



\&\p>{F') = valp(F) 



= E E E E max\F(B,x)-F(Ci,yi) 

A~V R x~^ R H y u ...,y d ~ i i R 1 (x)B,C { ,-Xd~G® R (A) > 



^ E E E max 

A~V R x~fi R y,,...,y d ~/i R (x) ' 

^ E E E max \g A {x) - guCj/OI 

A~V R x~n R H y u ...,y d ~n R H {x) ' 
= A f vR y&l ^ A) 



E F(B,x)- E F(Ci,yi) 

B~G® R (A) d~G® R (A) 



Lemma 6.3. 



Proof. 



E E g A {xf > E F 2 {A,x)-val<p,{F') 



E n E g A (xY = E n E d 



A~V«x~^ A~V*jr~/i*fl,C~Gf i (A) 



F(fi, x)F(C, x) 



= - E E E F 2 (B, x) + F 2 (C, x) - (F(B, x) - F(C, x)) 2 



2 A ~V R x~(i R B,C~G® R (A) 

E E F 2 (A, x) - - E E E (F(B, x) - F(C, x)) 2 

2 A~V R x~fi R B,C~G® R (A) 



(6.1) 



where in the last step we used the fact that B, C have the same distribution as A ~ V R . Since the function F is 
bounded in [0,1], we have 



E E E (F(B, x) - F(C, x)) 2 < E E E \F(B, x) - F(C, x)\ 



A~V R x~[i R H B,C~G® R (A) 



A~V R x~fi R B,C~G® R (A) 



(6.2) 



E E E \F(B,x)-F(C,x)\ 

A~V R x~fi R B,C~G® R (A) 

< E E E E \F(B,x)-F(D,y)\ + \F(C,x)-F(D,y)\ 

A~V R x~ti R H y~ti R H (x) B,C,D~G® R (A) 

= 2 E E E E \F(B,x)-F(D,y)\ (because (B,D), (C,D) have same distribution ) 

A~V R x-iif, y~H R H (x) B,D~G® R (A) 

< 2 E E E E max \F(B, x) - F{D t , y t )\ 

A~V R x~ M R y u ...,y d ~ M R (x) B,D u ...,D d ~Gf(A) ' 



= 2val P (F) = 2va\f»(F') 
Equations (6.1), (6.2) and (6.3) yield the desired result. 



(6.3) 

□ 
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Lemma 6.4. 

E Vardg A ]= E E \g A (x) - g A (y)\ > ^(Var^F]) 2 - val^(F') 

A~V R A~V R x,yE/i R 2 

Proof. Since the function g A is bounded in [0, 1] we can write 

E E \g A (x) - g A {y)\ > E E (g A {x) - g A {y)) 2 

A~V R x,yefi R A~V R x,ye^ R 

> E E g 2 A (x)-B E g A (x)g A (y) (6.4) 

In the above expression there are two terms. From Lemma 6.3, we already know that 

E EflJ(jc)> E F 2 {A,x)-val P ,(F') (6.5) 

A~V R xeft R A (A,x)~<V 

Let us expand out the other term in the expression. 

E E g A (x)g A (y) = E E E F'(U(B,x))F'(U(C,y)) (6.6) 

A x,m% A B,C~Gf(A) x,y€(j* 

Now consider the following graph *H on 'V' defined by the following edge sampling procedure. 

- Sample A € V R , and x,y € //* . 

- Sample independently B ~ G® R {A) and C ~ G® R (A) 

- Output the edge U(B, x) and U(C, y) 

Let A denote the second eigenvalue of the adjacency matrix of the graph < H. 
E E E F' (n(B, x))F' (U(C, y)) = {F' , <HF' ) 

A B,C~G® R (A) *,ye/4 

4 2 



< E F'(U(A,x))\ + A[ E (F'(U(A,x))Y-( E F'(U(A,x))y 

\(A,x)~<V J \(A,.t)~^V {A,x)~<V 

= A E F(A, xf + (1 - i)( E F(A, x)f (because F\U{A, x)) = F{A, x)) 

(A,x)~<V (A,x)~ r V 

Using the above inequality with equations (6.4), (6.5), (6.6) we can derive the following, 

E E \g A {x) - g A {y)\ > E E g\(x) - E E g A {x)g A {y) 



A~V R x,yepi R A~V R xefi R A x,yefi R 



>(\-A) 



E F 2 (A,x)-( E F(A,x)) 2 



- val P ,(F') 



(A,x)~<V (A,x)~'V 

> (\ - A)Var[F] - val p/ (F') 

>{l- i)(Vari[F]) 2 - valp,(F') (because Var[F] > Vari[F] 2 for all F) 

To finish the argument, we need to bound the second eigenvalue A for the graph 'H. Here we will present a 
simple argument showing that the second eigenvalue A for the graph fi is strictly less than \. Let us restate 
the procedure to sample edges from 7Y slightly differently. 
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- Define a map M : V x V H ^ (VU X) x (V H U {X}) as follows, M(b,x) - (b,x) if x e {j,f} and 
A1(6, x) = (J., X) otherwise. Let IT' : ((VU X) x (V//U X)) R ->(Vx {5, ?}) 1R| denote the following map. 

Yl\B',x') = {(b' i ,x' i )\x i e{s,t}} 

- Sample A e V R and x, y e /if, 

- Sample independently B = (b\, . . . ,b R ) ~ G® R (A) and C = (ci, . . . ,c«) ~ G^(A). 

- Let At(B, jc), At(C, */) e ((V U {X}) x (Vh U {X})) r be obtained by applying At to each coordinate of 
(B, x) and (C, «/). 

- Output an edge between (W(M(B, x)), U'(M(C, y))). 

It is easy to see that the above procedure also samples the edges of *H from the same distribution as earlier. 
Note that IT is a projection from ((VU X) x (V#U X)) R to (V x {5, ?}) (Ri . Therefore, the second eigenvalue 
of the graph 'H is upper bounded by the second eigenvalue of the graph on (( VXJ J.) x (Vh U { X})) R defined 
by M(B, x) ~ M(C,y). Let 'Hi denote the graph defined by the edges M(B, x) ~ M(C,y). Observe that 
the coordinates of "Hi are independent, i.e., 'Hi = *H R f° r a g ra ph 'Hj corresponding to each coordinate of 
M(B, x) and M(C, y). Therefore, the second eigenvalue of *Hi is at most the second eigenvalue of 'Hz. The 
Markov chain % on (V U {X}) x (V#U J.) is defined as follows, 

- Sample a e V and two neighbors b ~ G v (a) and c ~ G n (a). 

- Sample x, y e Vh independently from the distribution fig. 

- Output an edge between M(b, x) M(c, y). 

Notice that in the Markov chain 'Hi, for every choice of M(b, x) in (V U {X}) x (V#U X), with probability at 
least I, the other endpoint M(c, y) = (X, X). Therefore, the second eigenvalue of 'Hi is at most i, giving a 
bound of i on the second eigen value of "H. a 

Now we restate a claim from [RST12] that will be useful for our our soundness proof. 

Theorem 6.5. (Restatment of Lemma 6.11 from [RST12]) Let G be a graph with a vertex set V. Let a 
distribution on pairs of tuples (A, B) be defined by A ~ V R , B ~ G® R (A). Let € : V R — > [R] be a labelling 
such that over the choice of random tuples and two random permutations tta, Kb 

P P {n-/ (t(n A (A))) = n B l (t(n B (B)))} > { 

A~V R ,B~Gf 1 R (A)"A,n B K ' 

Then there exists a set S <zV with vol(5) € -^, ^ satisfying ®(S) < 1 - £/16. 

The following lemma asserts that if the graph G is a TVO-instance of Small-Set Expansion (y, 5,M) then 
for almost all A e V R the functions have no influential coordinates. 

Lemma 6.6. Fix 8 = 1/7?. Suppose for all sets S c V with vo\(S) e (S/M, M5) , 0(5) > 1 - y then for all 

T>0, 

P [3/ I Inf.-tTi-,^] > t] < -5-5-r ■ max(l/M,y) 
a~v r L J r J e 2 ?/ 2 
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Proof. For each A e V R , let L A = {i € [R] | M^T^fa) > r/2J and L' A = [i € [R] | lnf,(ri_^ A ) > r). Call a 
vertex A e V R tobe good if L' A + 0. By Lemma 5.4, the sum of influences of Ti- n g A is at most ^-Var[gA] < ~. 
Therefore, the cardinality of L' A is upper bounded by \L' A \ < ^-. Similarly, the cardinality of L A is upper 
bounded by \L A | < ^. 

The lemma asserts that at most a -j^tt " max( 1 /M, y) fraction of vertices are good. For the sake of 
contradiction, assume that F AeV R \l' a * 0J > lOOOmaxCl/M^)/!- 2 ^ 2 ?/ 2 . 

Define a labelling f : V R — > [R] as follows: for each A e V R , with probability j choose a random 
coordinate in L A and with probability !/2, choose a random coordinate in L^. If the sets L A , L' A are empty, 
then we choose a uniformly random coordinate in [/?]. 

Observe that for each A e V R , the function g A is the average over bounded functions /#: V^ — > [0, 1], 
where B ~ G^(A). Fix a vertex A € V R such that U A + § and a coordinate / e L A . In particular, we have that 
Inf/ITi-,,^] > t. Using convexity of influences, this implies that, 

^B~Gf R (A) l n Mri_;/./B] > T. 

Specifically, this implies that for at least a | fraction of the neighbours B ~ G R (A), the influence of the i th 
coordinate on f B is at least ^. Hence, if L' A + then for at least a r/2 fraction of neighbours B ~ G® R (A) we 
have L' A r\L B + 0. 

By definition of the functions f A ,g A , it is clear that for every permutation n : [R] — > [/?], /a(tt(jc)) = 
A(A)(-"c) an d <SU0r(X)) = #;r(A)(*)- Therefore, for every permutation 7r : [7?] — > [/?] and A e V R , 

L A = n~ l {L n(A) ) and L' A = n~ l {L' n(A) ) 

From the above discussion, for every good vertex A e V R , for at least a r/2 fraction of the vertices 
B ~ G® R (A), and every pair of permutations n A ,^B '■ [R] —> [R], we have n A {L' n , A) ) n ^(L^g)) + 0- This 
implies that, 

P P W tffaiCA))) = tt^ 1 (€{n B {B)))\ 

A~V R ,B~G® R (A) n A,nB K > 

> P [L' A * 0] • P [L A ni s # 0|L A * 0] • P [^H^aCA))) = n B \l(n B (B))) \ L' A n L B * ©1 

A-V* B~Gf R (A) L J 



> 16max(VM,y) 



By Theorem 6.5, this implies that there exists a set S <z V with vol(>S) € [^, -^] satisfying 0(5) < 1 - y. A 
contradiction. n 

Finally, we are ready to show the soundness of the reduction. 

Theorem 6.7. (Soundness) For all s, d there exists choice of M and y, r\ such that the following holds. 
Suppose for all sets S cV with vo\(S) e (6/M, MS) , $>(S) > 1 - 77, then for all F' : "V -> [0, 1] such that 
Vari[F'] > i, we have val^(F') > Q( ^elogd) 
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Proof. Fix r] = e. We will choose r to small enough so that the error term in the soundness of dictatorship 
test (Proposition 5.5) is smaller than s. Since the least probability of any vertex in Markov chain H is e, 
setting t = £ { I E would suffice. 

First, we know that if G is a VO-instance of Small-Set Expansion (y, 6, M) then for almost all A eV R , 
the function g& has no influential coordinates. Formally, by Lemma 6.6, we will have 

P \3i | InUr^A] > t] < -^ • max(l/M,y) . 

A~V R L J T J ?7 Z 

For an appropriate choice of M, y, the above inequality implies that for all but an e-fraction of vertices 
A e V R , the function g^ will have no influential coordinates. 

Without loss of generality, we may assume that valp'(F') < y/e log d, else we would be done. Applying 
Lemma 6.4, we get that E AeV « Vari [#a] ~> (Varj [F]) 2 - vaLp'(F') ^ i. This implies that for at least a jL 
fraction of A e V s , Vari[^A] > 1/400. Hence for at least an 1/400 - e fraction of vertices A e V R we have, 

VanhjA] ^ — - and maxlnf,(ri_^ A )) < r 

4UU i 

By appealing to the soundness of the gadget (Proposition 5.5), for every such vertex A e V R , val r (g^) > 
Q( -Ji log d) - 0(e) = Q.( ^Jslogd). Finally, by applying Lemma 6.2, we get the desired conclusion. 

val*»(F')> E val„ii(0'A)>il(VeIogJ) 



7 Reduction from Analytic d-Vertex Expansion to Vertex Expansion 

Theorem 7.1. A c-vs-s hardness for J-Balanced Analytic Vertex Expansion implies a 4 c-v.s-s/1 6 hardness 
for balanced symmetric-vertex expansion on graphs of degree at most D, where D = max { lOOd/s, 2 log(l/c)}. 

At a high level, the proof of Theorem 7.1 has two steps. 

1 . We show that a c-vs-s hardness for Balanced Analytic Vertex Expansion, implies a 2 c-vs-s/4 hardness 
for instances of Balanced Analytic Vertex Expansion having uniform distribution (Proposition 7.2). 

2. We show that a c-vs-s hardness for instances of J-Balanced Analytic Vertex Expansion having 
uniform stationary distribution implies a 2 c-vs-s/2 hardness for balanced symmetric-vertex expansion 
on ©(D)-regular graphs. (Proposition 7.5). 

Proposition 7.2. A c-vs-s hardness for Balanced Analytic Vertex Expansion, implies a 2 c-vs-s/4 hardness 
for instances of Balanced Analytic Vertex Expansion having uniform distribution. 

Proof. Let (V, P) be an instance of Balanced Analytic Vertex Expansion. We construct an instance (V',P') 
as follows. Let T = 2n 2 . We first delete all vertices i from V which have p(i) < l/2n 2 , i.e. V «— 
V\\i e V : p(i) < l/2n 2 \. Note that after this operation, the total weight of the remaining vertices is still at 
least 1-1 /2n and the Balanced Analytic Vertex Expansion can increase or decrease by at most a factor of 2. 
Next for each i, we introduce introduce \p(i)T~\ copies of vertex /. We will call these vertices the cloud for 
vertex i and index them as (i, a) for a € [ji(i)T]. 
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We set the probability mass of each (d + l)-tuple ((/, a), (j\ , b\) . . . , (jd, bd)) as follows : 

P'((i, a), (jubi)..., (j d , b d )) = nt ' Jl 7" Jrf) 

W)T)-n d t=l Ui{j t )T) 

It is easy to see that //(/, a) = l/T for all vertices (/, a) e V . The analytic d- vertex expansion under a 
function F is given by, 

^{{i,aHhM)-,(jdM)~'P' max f Wii,a) ~ F(ji,b t )\ 

^{UaUj,b)~H' WiU a ) ~ F (J> b )\ 

where X = (i, a) and Y{ = (j, b) which are sampled as follows: 

1 . Sample a ( d + l)-tuple (i,ji,..., jd) from P. 

2. Sample a uniformly at random from 1, . . . ,fi(i)T. 

3. Sample be uniformly at random from {1, . . . ,fi(j{)T} for each I € [d]. 

Completeness. Suppose, $( V, f) < c. Let / be the corresponding cut function. The function / : V i-> {0, 1} 
can be trivially extended to a function F : V i-> {0, 1} thereby certifying that <b(V ,V) < 2c. 

Soundness. Suppose ®(V, P) ^ s. Let F : V h-> {0, 1} be any balanced function. By convexity of absolute 
value function we get 



E max\F(i,a) - F(bh,)\> E max 

((i,aUjiM),-Xjj,b d ))~'P' t ' ' (i,ji,...,j d )~P I 



EF(i,a)-EF(j t ,bt) 

a e 



So if we define f(i) = E a F(i, a), the numerator for analytic d- vertex expansion in (V, P) for / is only 
lower than the corresponding numerator for F in {V ,P'). We need to lower bound the denominator, 
^i,j~M 1/(0 _ /(/)!• The requisite lower bound follows from the following two lemmas. 

Lemma 7.3. 

E |/(0 -f(D\> E \F(i,a)-F(j,b)\- E \F(i,a) - F(i,b)\ 

i,j~H (i,a),(j,b)~M' (i,ci),(i,b)~H' 

Proof. The Lemma follows directly from the following two inequalities. 

E \F(i, a) - F(j, b)\ < E \F(i, a) - f(i)\ + E \F(j, b) - f(j)\ + E \f(i) - f(j)\ (Triangle Inequality) 

(i,a),0,b) (i,a) (j,b) i,j 



and 



E \F(i, a) - f(i)\ < E \F(i, a) - F(i, b)\ 

i,a i,a,b 



Lemma 7.4. 



E \F(i, a) - F(i, b)\ < 2 val^(F) = 2 E max \F(i, a) - F{j e , c e )\ 

i,a,b (i,a),(ji,ci),...(jd,c d )~P' I 
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Proof. Sample (i, j\ , . . . , j c i) ~ P. For any neighbour (J, c) of (i, a), (i, b), using the Triangle Inequality we 
have 

\F(i, a) - Fiji, b)\ < \F(i, a) - F(j, c)\ + \F(j, c) - F(i, b)\ 

Therefore, 

Zt \F(i, a) - F(j t , ce)\ + Z € \F(i, b) - F{j e , c ( )\ 



\F(i,a)-F(i,b)\ < 



< max |F(i, a) - F(j { , c e )\ + max \F(i, b) - F(j e , c e )\ 



Taking expectations over the uniformly random choice of a and b from the cloud of i, 
E \F(i, a) - F(i, b)\ < 2 E max \F(i, a) - F{j c , c t )\ 



□ 



Lemma 7.3 and Lemma 7.4 together show that 

E(i a) ( ; b) \F(i, a) - F(j, b)\ 

E 1/(0 - f(j)\ > (,) ' a) ' \ KJlJ1 . 

as long as the value val^>/(F) < Vari[F]/4. Therefore, for any F : V h-» {0, 1}, 

^((i,a),{hM)-,(uM)~'P' max ^ l^fe a ) - F Ue, b{)\ s_ 
fc(i,a),(j,b)~ti' \ F (i> a) - F ti> b )\ " 4 ' 

Theorem 4.3 shows that the minimum value of Balanced Analytic Vertex Expansion is obtained by 
boolean functions. Therefore, 0(V',P') > s/4. □ 

Proposition 7.5. A c-vs-s hardness for instances of c{-Balanced Analytic Vertex Expansion having uniform 
stationary distribution implies a 2 c-vs-sIA hardness for balanced symmetric-vertex expansion on 0(D)- 
regular graphs. Here D > max{100<i/s, 21og(l/c)}. 

Proof. Let (V',P") be an instance of ^-Balanced Analytic Vertex Expansion. We construct a graph G from 
(V',P') as follows : For each vertex X we pick D neighbors by sampling D/d tuples from the marginal 
distribution of f on tuples containing X in the first coordinate. 

Let deg(/) denote the degree of vertex i, i.e. the number of vertices adjacent to vertex i in G. It is easy 
to see that deg(i) > D and E [deg(i)] - 2D V/ € V(G). Let L = {i e V(G)\ deg(/) > AD). Using Hoeffding's 
Inequality, we get a tight concentration for deg(z') around 2D. 

P [deg(i) > AD] < e' D . 

Therefore, E [|L|] < n/e D . We delete these vertices from G, i.e. V(G) <— V(G)\L. With constant probability, 
all remaining vertices will have their degrees in the range [D/2, AD]. Also, the vertex expansion of every set 
will decrease by at most an additive l/e D . 

Completeness. Let 0(V',P') < c and let F ; V t-> {0, 1} be the function corresponding to ®(V',P'). Let 
the set S be the support of the function F. Clearly, the set S is balanced. Therefore, with constant probability, 
we have 

O v (G) < 0^(5) < 0(V',P') + \/e D < 2c . 
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Soundness. Suppose 0(V',P') ^ s. Let F : V \-* {0, 1} be any balanced function. 
Since the max is larger than the average, we get 



D/d 



E max \F(X)-F(Yi)\>—Y E max|F(X) - F(Y ( )\ 

X Yj€N G (X) D -^ (X,Yi,...,Y d )~P i 

By Hoeffding's inequality, we get 



7=1 



P 



E max \F(X) - F(Yi)\ < s/4 

X YieN(X) 



< P 



D/d 



-V E mzx\F(X)-F(Yd\<s/4 

D ^ (X,Y u ...,Y d )~P i 



exp(-n(sD/d) 2 ) 



Here, the last inequality follows from Hoeffding's inequality over the index X. There are at most 2" boolean 
functions on V. Therefore, using a union bound on all those functions we get, 

P [$ V (G) > s/4] >\-2 n exp {-n{sDI df) . 
Since D > d/s, we get that with probability 1 - o(l), <D V (G) > s/4. 



Proof of Theorem 7.1. Theorem 7. 1 follows directly from Proposition 7.2 and Proposition 7.5. 



□ 
□ 



8 Hardness of Vertex Expansion 

We are now ready to prove Theorem 1.3. We restate the Theorem below. 

Theorem 8.1. There exists absolute constants C,/3 such that for every s > 0, it is SSE -hard to distinguish 
between the following two cases for a given graph G with maximum degree d. 



Yes ; There exists a set S c V such that 



No ; For all sets S c V, 



P(S)<s 



<f> y (S) > yjs log d 



Proof From Theorem 6.1 and Theorem 6.7 we get that for an instance of Balanced Analytic Vertex 
Expansion (V, "P), it is SSE -hard to distinguish between the following two cases cases: 



Yes : 



No : 



0(V,!P)<£ 



<£(V,!P)>ci yjelogd 



Now from Theorem 7.1 we get that for a graph G, it is SSE -hard to distinguish between the following 
two cases cases: 
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Yes : 



O v < bal < e 



No : 

<D V ' bal > c 2 V^logJ 

We use a standard reduction from Balanced vertex expansion to vertex expansion. For the sake of 
completeness we give a proof of this reduction in Lemma B.2. Using this reduction, we get that for a graph 
G, it is SSE -hard to distinguish between the following two cases cases: 



Yes : 



O v < £ 



No : 

<D V > c 2 yjelogd 

Finally, using the computational equivalence of vertex expansion and symmetric vertex expansion 
(Theorem A.l), we get that for a graph G, it is SSE -hard to distinguish between the following two cases 
cases: 



Yes : 



No : 



b^s 



4> V > c 2 ^elogd 



This completes the proof of the theorem. 



9 An Optimal Algorithm for vertex expansion 

In this section we give a simple polynomial time algorithm which outputs a set S whose vertex expansion is 
at most 0{ ^JOPT\ogd). We restate Theorem 1.2. 

Theorem 9.1. There exists a polynomial time algorithm which given a graph G = (V, E) having vertex 
degrees at most d, outputs a set S c V, such that V (>S) = 0( ^JOPTlogd). 

For an undirected graph G, Bobkov et al. [BHTOO] define Aco as follows. 

, def . ItiTOZZj^iiXi - Xj) 2 

/loo = mm ■ 



Z,-*?-i(Z,-*) 2 

They also prove the following Theorem. 
Theorem 9.2 ([BHTOO]). For any unweighted, undirected graph G, we have 

— < (p < y2Aoo 
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Consider the following SDP relaxation of A 



SDP 9.3. 


SDPval = min V a, 
subject to: 






\\vj - Vi\\ < ai 


Vi e V and V; ~ i 




V INI 2 - - 


i 


= 1 





It's easy to see that this is a relaxation for Aoo- We present a simple randomized rounding of this SDP 
which, with constant probability, outputs a set with vertex expansion at most C -y/OPTlogJ for some absolute 
constant C. 



Algorithm 9.4. 


- Inpu 

- Out]. 


f; AgraphG = (V,£) 


>m? ; A set S with vertex expansion at most 72 ^SDPval log d (with constant probability). 


1. 


Compute graph G' as in Theorem A. 1, let n = \V(G')\. 


2. 


Solve SDP 9.3 for graph G'. 


3. 


Pick a random Gaussian vector g ~ N(0, 1)". 


4. 


def 
For each i € [n], define xi = (vu g). 


5. 


Sort the x r 's in decreasing order xi l > X{ 2 > . . . ^„. Let 5j denote the set of the first j vertices 




appearing in the sorted order. Let I be the index such that 




I = argmin 1<y<n/2 O v (5y) . 


6. 


Output the set corresponding to 5/ in G. 



We first prove a technical lemma which shows that we can a recover a a set with small vertex expansion 
from a good linear-ordering (Step 3 in Algorithm 9.4). 

Lemma 9.5. For any U\,y%,., .,y„ e R + U {Oj, let Y = [yiy 2 . . .y„] T and a = ' ma ^~' lJl u ' , Then 
35 c supp(F) such that (p (S) < a. Morover, such a set can be computed in polynomial time. 

Proof. W.l.o.g we may assume that y\ > j/2 ~> • • • ^ £/» ~> 0. Then 

2 ( max^,j<;(to-i/0 



and 



< a (9.1) 



^ < a (9.2) 



29 



def def 

Let / max = argmaXji/,- > 0, i.e. / max be the largest index such that i/, max > 0. Let S; = {yi, . . 



Suppose Vz < z max N V (S,-) > a\S{\. 
Now, from Inequality 9.2, 



Zt mzx HJ<i ( yj - yd Li maxj~u<i Z^~ (yi - Vm) ZiiUi ~ y*i)W,-)l E/(j// - J/h-i)|S*I 

a > = = = - = > a a 

LiVi LiVi ZjJ/i LiVi 

Thus we get a > a which is a contradition. Therefore, Hi < z' max such that V (>S,) < a. a 

Next we show a /too -like bound for the xfs. 

Lemma 9.6. Let x\, . . . ,x n be as defined in Algorithm 9.4. Then, with constant probability, we have 

Zi max H (xi - xj) 2 



lui x { n \lui x i) 



< 12 SDPval log d. 



Proof. We will make use of the following fact that is part of the folkore about Gaussian random variables. 
For the sake of completeness, we prove this Fact in Appendix B (Fact B.3). 

Fact 9.7. Let Y\,Y2,...,Ydbed normal random variables with mean and variance at most cr 2 . Let Y be 

def 
the random variable defined as Y - max {F,|/ € [d]}. Then 



E[Y] <2o-^logd 



Now using this fact we get, 



max(;ic ; - - xjY 



]-' 



= E 



max(u,- - Vj, g) 1 



/-< 



< 2 max \\Vj - vAl logd. 



j~' 



Therefore, E £,- max^,(x ; - Xj) 1 < 2 SDPval log d. Using Markov's Inequality we get 

V max(;t; - xj) 2 < 6 SDPval log d 

i 

For the denominator, using linearity of expectation, we get 



P 



1 



But since each term in the denominator is a Gaussian random variable, we get that 

2 
Using a union bound on the two events, we get that 

1 



2>H(l>H 



y.- max ;«.,-(*i - xi) 2 

— - J \ Kl — g- < 12 SDPval log d 



JLt^-iOLiXiY 



n 
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We are now ready to complete the proof of Theorem 1.2. 

Proof of Theorem 1.2. Let the xfs be as defined in Algorithm 9.4. W.l.o.g, we may assume that 1 |supp(x + )| < 
|supp(x~)|. For each i e [n], we define j/,- = x*. 

Lemma 9.6 shows that with constant probability we have 

y,max,^,(x,- - X;) 2 

— — } -L < 12 SDPvallog d. 



2ui x j n \2ui x i) 



We need to show that 



Z,max yW 



2 2 



/£i -max^i - Xj) 2 

This fact is proved in [BHTOO]. For the sake of completeness, we give a proof of this fact in Appendix B 
(Lemma B.l). Using Lemma 9.6, we get 



Z/max^ 



2 2 



<72VSDPvallogd. 



Z; iff - „ (Z; »i) 

From Lemma 9.5 we get that the set output in Step 3 of Algorithm 9.4 has vertex expansion at most 
72VSDPvallogd. □ 
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A Reduction between Vertex Expansion and Symmetric Vertex Expansion 

In this section we show that the computation of the vertex expansion is essentially equivalent to the computa- 
tion of symmetric vertex expansion. Formally, we prove the following theorem. 

Theorem A.l. Given a graph G, there exists a graph G' such that max, g y(G) deg(/) = max ig y(G') deg(/) and 
V (G) = @(® V (G'))- Moreover, such a G' can be computed in time polynomial in the size ofG 

Proof. Given graph G, we construct G' as follows. We start with V(G') = V(G) U E(G), i.e., G' has a vertex 
for each vertex in G and for each edge in G. For each edge {u, v] e E(G), we add edges {u, {u, v}} and {v, {u, v}} 
in G' . For a vertex i e V(G) n V(G'), we set its weight to be w(i). For a vertex [u, v) e E(G) n V(G'), we set 
its weight to be min [w(u)/ deg(w), w(v)/ deg(u)}. 

It is easy to see that G' can be computed in time polynomial in the size of G, and that max i6 y(G) deg(i) = 
max ;ey(G /) deg(i). 
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We first show that V (G) > O v (G')/2. Let 5 c V(G) be the set having the least vertex expansion in G. 
Let 

S' = S U {{u, v} | {«, v} € E(G) and u e 5 or ye 5} . 

By construction, we have w(5) < w(S'), Ng(S) = N G >(S') and 

w(w) 



w(N G ,(S'))< V teg(u)-±/-<w(N G '(S')). 



Therefore, 



<D (G ) < O g ,(5 ) = — < — — = 2(p G (S) = 2cp (G) . 

w(S') w(S) 

Now, let S' c V(G') be the set having the least value of 3> G ,(5') and let e = <D G ,(S')- We construct the 
set S as follows. We let S i = S'\N G >(S'), i.e. we obtain S i from 5" by deleting it's internal boundary. Next 
we set S = 5 1 n V(G). More formally, we let S be the following set. 

S =[veS'nV(G)\v<tN G '(S')} . 

By construction, we get that N G (S) £ N G >(S') U N G >(S'). Now, the internal boundary of 5" has weight at 
most sw(S'). Therefore, we have 

w(S 1 )>(\-s)w(S'). 

We need a lower bound on the weight of the set S we constructed. To this end, we make the following 
observation. For each vertex {u, v) e S i n E(G), u or v also has to be in S \ (If not, then deleting {u, v) from 
5 ' will result in a decrease in the vertex expansion thereby contradicting the optimality of the choice of the 
set 5")- Therefore, we have the following 

Z\TH f W(u) W(u) ] v-1 

w({u,v})= ) mm ,, K V w(u) = w(S) . 

t—L deg(w) deg(w) ^— ' 

Therefore, 



">GS) > — ^ — > (1 - e)— - — 



Therefore, we have 



^ V 7 ^ GV ' w(S) (\-e)w(S')/2 GK ' V ' 



Putting these two together, we have 

< O v (G') < 40 V (G) . 



n 
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B Omitted Proofs 

def 

Lemma B.l ([BHTOO]). Letz\,...,z n £R and let x t = zf. Then 



E.-max/-.,- 



2 2 

x 2 -x 2 



<6 Zirmx^iizi-Zj) 2 



Proof. W.l.o.g we may assume that |supp(Z + )| = |supp(Z~)| = fn/21 and that z\ > Z2 > • ■ ■ > Z n . 

Note that for any i e [n], we have max ; ^, i&7<! (z| - zp 2 + n®Xj~i,&j>i(z] - zj) 2 < 2 maxy^Kzy - zt) 2 . Now, 

E; max^Kz; - zt) 2 Zi max j<i&H (z-} - zt) 2 + E; max ><&iH (zJ - zp 2 



^"' z r 2 (Efesupp(Z+) Z; + Ejesupp(Z-) Zj ) 

'E, maxy<,& H (zt - z+) 2 Ei max j>i&H {z'] - zp 2 



^ mm- , , 

{ 2 E;'esupp(Z+) z t 2Eiesupp(Z-)Z, 

W.l.o.g we may assume that 

E; maxy<, & y^(z} - z r + ) 2 E; mzx j>i& j~i(zj - zj) 2 



Eiesupp(Z+)Z,- Ei£Supp(Z-)Z; 

E; max iW (xy - x,-) 2 E< max iW (zy - Zt) 2 
^ z- 



E,-* 2 E,z 2 

We have 



max (x 2 - x 2 ) = max (xj - x ( -)(xy + xf) 



< max ((Xy - x,) 2 + 2x,(x; - X;)) 

j~Uj<i v ' 

< max (xj - xi) 2 + 2xi max (xy - x,) 
HJ<i HJ<i 



Zmax(x; - x,) 2 + 2 |> x 2 /max(x; - x,) 2 Cauchy-Schwarz 

. j~u<i \t yj-^ 

i M W-+2 ^ t pl x y. x 2 



Thus we have 



E,- rmx HJ<i (x^. - xj) K\ maxy. ; (zy - z«) 2 

T^ < 6 A " T 



Lemma B.2. A c-vs-s hardness for b-Balanced-vertex expansion implies a 2 c-vs-s/2 hardness for vertex 
expansion. 

Proof. Fix a graph G = (V,E). 
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Completeness. If G has Balanced- vertex expansion at most c, then clearly its vertex expansion is also at 
most c. 

Soundness. Suppose we have a polynomial time algorithm that outputs a set S having (f> y (S) < s whenever 
G has a set 5" having V (5") < 2c. Then this algorithm can be used as an oracle to find a balanced set of 
vertex expansion at most s. This would contradict the hardness of Balanced- vertex expansion. 

First we find a set, say T, having (p^{T) < s. If we are unable to find such a T, we stop. If we find such a 
set T and T has balance at least b, then we stop. Else, we delete the vertices in T from G and repeat. We 
continue until the number of deleted vertices first exceeds a b/2 fraction of the vertices. 

If the process deletes less than b/2 fraction of the vertices, then the remaining graph (which has at least 
(1 - b/2)n vertices) has conductance 2c, and thus in the original graph every ^-balanced cut has conductance 
at least c. This is a contradiction ! 

If the process deletes between b/2 and 1/2 of the nodes, then the union of the deleted sets gives a set T' 
with v (r') < s and balance of T at least b/2. 

Therefore, starting with b = 2b, we arrive at a contradiction. □ 

Fact B.3. Let Y\,Y2,...,Ydbed standard normal random variables. Let Y be the random variable defined 

def 

as Y - max{F,-|z e [d]}. Then 

E [Y] < 2 ^f^ogd 

Proof. For any Z\, . . . ,Zj e R and any p e Z + , we have max, |Z,-| < (Z,-Z?)p. Now Y 2 = (max,- X,) 2 < 
max,- Xf. 



E [y 2 ] < E 



< E 



(p)\2P 



EH*?]) (2 " } ' " 



E^ : 



2p 



( Jensen's Inequality ) 



< 2pdp (using (2p)\/p\ < (2p) p ) 



Picking p = log d gives E I Y 2 1 < 2e log d. 
Therefore E [Y] < V E [ y2 ] < ^2e\ogd. 



C Noise Operators 

Let H be a Markov chain and let F : V(H k ) i-> {0,1} be any boolean function. In this section we prove some 
basic properties of Yi-^F. We restate the definition of our Noise Operator Ti- V . 

ri_„F(X) - (1 - i/)F(X) + 7/ E X F(Y) 

The Fourier expansion of the function F is F = ^o- /o-^o- where {go-} is the set of eigenvectors of H k . It is 
easy to see that e 



-cr. 



) ea- k , where the {go-.j are the eigenvectors of H. 



35 



Lemma C.l. (Decay of High degree Coefficients) Let Qj be the multi-linear polynomial representation of 
\Ti- v F(X) - ri_„F(y;)|. Then, 

Var(ef) < (1 - e??) 2p 



Proof. 



T X - n F(X) = (1 - rj)F(X) + r, E F(Y) 



= T,f* 



E 



Y~X 

e a (X) + Y ^ X F{Y) 



= Yf (T n i& J{\-j ] )e (Ti {X i )+ E ^..(T;)) 

~f \ Y i~ X i I 



We bound the second moment of Ti^F as follows 



E (T^FiX)) 2 = £ % E fW |(1 - 77)^(^) + ^ K Xj e ai (Yi)\ 



= V /£lW (1 - 7/) 2 E ^(X;) 2 + ^ 2 E E ^<Y*) + 2/7(1 - 77) E E e^Xde^Yd 

*—* I -a; a, y/j^Af y A, //~A; I 

= J] ^n (6(r ((1 - T/) 2 + n 2 X] + 2/7(1 - 77H) 

= J]^n iecr (i-77 + 77io 2 



Therefore, 



Var(2f) < 4^ ffri^il-n + nM) 2 

a-.\cr\>p 

< 2 ^(1-^) 2H 
cr:\a-\>p 

< (l-e??)^ 



Here the second inequality follows from the fact that all non-trivial eigenvalues of H are at most 1 - e 
and the third inequality follows Parseval's indentity. □ 
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